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(54) Long-chain prenyl diphosphate synthase 

(57) The present invention discloses a mutated 
enzyme comprising a geranylgeranil diphosphate syn- 
thase having its origin in wild type Sulfolobus acido- 
caldarius wherein, one of at least phenylalanine at 



position 77, methionine at position 85, valine at position 
99, tyrosine at position 101, phenylalanine at position 
1 18, arginine at position 199 and aspartic acid at posi- 
tion 312 is substituted with another amino acid. 
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Description - — - 

BACKGROUND OF INVENTION 

1 . Field of Invention 

The present invention relates to a mutant prenyl diphosphate synthase that is able to synthesize prenyl diphos- 
phate having a longer chain than prenyl diphosphate synthesized by the native prenyl diphosphate synthase. 

2. Related Art 

Prenyl diphosphate is highly valuable in biosynthesis pathways, functioning as a precursor of steroids, a precursor 
of caratenoids, being a transition substrate of prenylated proteins, being a substrate for synthesis of vitamin E, vitamin 
K and ubiquinone (CoQ) and so forth. Prenyl diphosphate exists in various forms, including dimethylallyl diphosphate 
(DMAPP; C5), geranyl diphosphate (GPP; C10), farnesyl diphosphate (FPP; C15), geranylgeranyl diphosphate 
(GGPP; C20), geranylfarnesyl diphosphate (GFPP; C25), hexaprenyl diphosphate (HPP; C30), heptaprenyl diphos- 
phate (HepPP; C35) and octaprenyl diphosphate (OPP; C40). 

Prenyl transferases, which synthesize these prenyl diphosphates, are enzymes that form prenyl diphosphate by 
continuously condensing isopentenyl diphosphate (IPP; C5) into allylic diphosphate, and exist in various forms, includ- 
ing farnesyl diphosphate synthase (FPS). geranylgeranyl diphosphate synthase (GGPS). geranytfarnesyl diphosphate 
synthase (GFPS), hexaprenyl diphosphate synthase (HexPS), heptaprenyl diphosphate synthase (HepPS) and octa- 
prenyl diphosphate synthase (OPS). 

However, among the above-mentioned prenyl diphosphates, only those from dimethylallyl diphosphate having 5 
carbon atoms to geranyl diphosphate having 20 carbon atoms are commercially available in small amounts as rea- 
gents, and a process for industrially synthesizing and recovering large amounts of prenyl diphosphates having longer 
chains is not known. 

The carbon chain length and stereoisomerism of synthesized prenyl diphosphates are known to be specifically 
determined depending on the particular enzyme. Until now. it has not been clear what type of mechanism is the factor 
in determining carbon chain length. 

Although prenyl transferases and their genes are known to be derived from bacteria, mold, plants and animals, 
these enzyme are typically unstable, difficult to handle and are not expected to be industrially valuable. 

The prenyl transferases and their genes of thermophilic organisms, which are stable and easy to use as enzymes, 
are only farnesyl diphosphate synthase (FPS) (Koyama, T. et at. (1995) J. Biol. Chem. 1 13. 355-363) and heptaprenyl 
diphosphate synthase (HepPS) (Koike-Takeshita, A. et al. (1995) J. Biol. Chem. 270. 18396-18400) from the moder- 
ately thermophilic archaebacterium, Bacillus stearothermophilus: geranylgeranyl diphosphate synthase (GGPS) from 
the hyper thermophilic bacterium. Sulfolobus acidocaldarius (Ohnuma, S.-i. et al. (1994) J. Biol. Chem. 268. 14792- 
14797); as well as farnesyl diphosphate/geranylgeranyl diphosphate synthase (FPS/GGPS) from the methane-produc- 
ing archaebacterium, Methanobacterium thermoautotrophicum (Chen, A. and Poulter. CD. (1993) J. Biol. Chem. 268, 
1 1002-1 1007). Only HepPS can synthesize prenyl diphosphate having 35 carbon atoms, and enzymes having thermal 
stability that synthesize prenyl diphosphates having 25 or more carbon atoms have not been reported. In addition, the 
abovementioned HepPS does not have adequate heat resistance, is composed of two types of subunits, and handling 
is not always easy. 

SUMMARY OF INVENTION 

Thus, the present invention provides a thermostable prenyl diphosphate synthase capable of synthesizing long- 
chain prenyl diphosphate, a process for its production, and a method for using said enzyme. 

In order to create an enzyme that can synthesize prenyl diphosphate having a longer chain length, the inventors of 
the present invention succeeded in creating a mutant enzyme able to synthesize prenyl diphosphate having a longer 
chain than naturally-occurring geranylgeranyl diphosphate synthase by treating DNA coding for geranylgeranyl diphos- 
phate synthase with a mutation agent, introducing the above-mentioned treated DNA into the yeast, Saccharomyces 
cerevisiae. deficient for hexaprenyl diphosphate synthase activity, and selecting a mutant DNA that can complement the 
above-mentioned deficient, and moreover, elucidated the relationship between the mutation site in the enzyme and the 
chain length of the prenyl diphosphate that is formed, thereby leading to completion of the present invention. 

Thus, the present invention provides a mutant enzyme wherein, least one of phenylalanine residue at position 77, 
methionine residue at position 85, valine residue at position 99. tyrosine residue at position 101 , phenylalanine residue 
at position 118, Arginine residue at position 199 and aspartic acid residue at position 312 in a geranylgeranyl diphos- 
phate synthase of Sulfolobus acidocaldarius origin is substituted with another amino acid, and which enzyme can syn- 
thesize prenyl diphosphate having at least 25 carbon atoms. 
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Moreover, the present invention provides a gene system that codes for the above-mentioned enzyme, and a proc- 
ess for producing the above-mentioned enzyme using that gene system. 

Furthermore, the present invention provides a process for producing a mutant prenyl diphosphate synthase com- 
prising the steps of culturing a host transformed with a gene in which the codon for phenylalanine residue located at the 
fifth N-terminal side positron from the N-terminal amino acid of the aspartate-rich domain I in a gene that codes for the 
native enzyme, is converted to a codon for a non-aromatic amino acid, thereby enabling the expression of a mutant 
enzyme that is able to synthesize prenyl diphosphates having a longer chain than the longest chain of prenyl diphos- 
phate synthesized by the native prenyl diphosphate synthase. 

In addition, the present invention provides a process for producing long-chain prenyl diphosphate using the above- 
mentioned enzyme. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 indicates the mutation site of the present invention in the geranyl diphosphate synthase derived from Sulfolo- 
bus acidocaldarius . The arrows in the drawing indicate two aspartate-rich domains. 

Fig. 2 is photograph that indicates the autoradiograph of a thin layer chromatography which shows the products in 
the case of allowing the mutant enzymes of the present invention produced in yeast to act on substrates IPP and (all- 
E)-FPP. The ellipses show the positions of cold authentic samples, which are geraniol. farnesyl, and geranilgeranil for 
a, b and c respectively 

Fig. 3 is a photograph that indicates the autoradiograph of a thin layer chromatography which shows the products 
in the case of allowing the mutant enzyme of the present invention produced in yeast to act on substrates IPP and (all- 
E)-GGPP. The ellipses show the positions of cold authentic samples, which are geraniol, farnesyl, and geranilgeranil for 
a, b and c respectively 

Fig. 4 is a photograph that indicates the autoradiograph of a thin layer chromatography which shows the products 
in the case of allowing the mutant enzyme of the present invention produced in coli to act on (A) substrates IPP and 
DM APR, and on (B) substrates IPP and GPP. The ellipses show the positions of cold authentic samples, which are gera- 
niol, farnesyl. and geranilgeranil for a. b and c respectively. 

Fig. 5 is the autoradiograph of a photograph that indicates a thin layer chromatography which shows the products 
in the case of allowing the mutant enzyme of the present invention produced in E opJi to act on (A) substrates IPP and 
(all-E)-FPP, and on (B) substrates IPP and (all-E)-GGPP. The ellipses show the positions of cold authentic samples, 
which are geraniol. farnesyl, and geranilgeranil for a, b and c respectively. 

DETAILED DESCRIPTION 

As a specific example in the present invention, a geranylgeranyl diphosphate synthase (GGPS) gene of the hyper 
thermophilic archaebacterium, 

Sulfolobus acidocaldarius. is used for the starting material. The cloning method of this gene is described in detail in the 
specification of Japanese Patent Application No. 6-315572. In addition, another example for cloning the gene is 
described in the present specification as Example 1 , and a nucleotide sequence and an amino acid sequence encoded 
thereby are shown as SEQ ID NO: 1. 

In the present invention, a cloned DNA is mutated in vitro. Although chemical treatment using a mutagen, or phys- 
ical treatment using UV light or X-rays can be used for the mutation means, chemical treatment is convenient to carry 
out Any routinely used chemical mutagen can be used for the mutagenesis for the present invention, an example of 
which is nitrite. 

A specific example of mutagenesis is shown in Example 2. 

The mutagentzed DNA is inserted into a yeast expression vector to prepare a DNA library. Any vector that is able 
to express an inserted extraneous gene in the yeast can be used as an expression vector, examples of which include 
a yeast plasmid such as pYEUra3 (available from Clonetech) and pYES2 (available from Invitrogen). 

The resulting plasmid library is introduced into a yeast mutant strain defective for the ability to synthesize hexapre- 
nyl diphosphate, which is one of the precursors of coenzyme 06. Since this mutant strain is unable to synthesize coen- 
zyme 06 necessary for non-fermentative sugar metabolism, it cannot be grown in medium that contains glycerol as the 
sole carbon source. Thus, if the yeast transformed by the above-mentioned library is cultured in glycerol medium and 
the strains that grow are selected, strains can be selected that have acquired the ability to synthesize prenyl diphos- 
phate having a large number of carbon atoms for coenzyme Q synthesis. 

Five positive clones were obtained in this manner from approximately 1400 transfer mants. As a result of purifying 
the plasmids from these clones, determining the nucleotide sequence of the inserted fragment, and predicting amino 
acid sequences that are coded, each mutant had changes in the amino acid sequence as indicated below. 

Mutant 1 : Methionine at position 85 changed to isoleucine, arginine at position 199 changed to lysine, aspartic acid 
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at position 312 changed to Asn 

Mutant 2: Phenylalanine at position 1 18 changed to leucine 
Mutant 3: Phenylalanine at position 77 changed to serine 

Mutant 4: Phenylanine at position 77 changed to leucine and valine at position 99 changed to methionine 
Mutant 5: Phenylalanine at position 77 changed to serine and tyrosine at position 101 changed to histidine 

In contrast to wild-type enzymes being unable to synthesize prenyl diphosphate having at least 25 carbon atoms, 
enzymes having amino acid sequences containing these changes were able to synthesize prenyl diphosphate having 
25 or more carbon atoms Those amino acid sequences having the above-mentioned amino acid substitutions are 
shown in SEQ ID NOs: 2 to 6. 

Thus, it can be logically surmised that if an amino acid at any one of the above-mentioned positions is replaced with 
another amino acid, a prenyl diphosphate having more carbon atoms than that synthesized by the native enzyme can 
be synthesized. Thus, the present invention provides a mutant enzyme in which at least one amino acid from among 
phenylalanine at position 77, methionine at position 85, valine at position 99. tyrosine at position 101 , phenylalanine at 
position 118, arginine at position 199 and aspartic acid at position 312 is replaced with another amino acid, and said 
enzyme is able to synthesize prenyl diphosphate having at least 25 carbon atoms. 

Particularly in the case that phenylalanine at position 77 is replaced with another amino acid, and preferably a non- 
aromatic amino acid such as serine or leucine, that enzyme is able to synthesize prenyl diphosphate having at least 25 
carbon atoms. Thus, in one embodiment, the present invention provides an enzyme in which at least phenylalanine at 
position 77 is replaced with another amino acid such as serine, leucine or another non-aromatic amino acid This type 
at enzyme includes enzymes in which replaced amino acids are present at one or a plurality of the other above-men- 
tioned positions. Examples at other amino acid positions include valine at position 99 and/or tyrosine at position 101 . 

Thus, the present invention includes enzymes in which only phenylalanine at position 77 is replaced, enzymes in 
which phenylalanine at position 77 and valine at position 99 are replaced, enzymes in which phenylalanine at position 
77 and tyrosine at position 101 are replaced, enzymes in which phenylalanine at position 77. valine at position 99 and 
tyrosine at position 101 are replaced, and enzymes in which phenylalanine at position 77 and one or a plurality of amino 
acids at the above-mentioned positions are replaced. 

According to another mode of the present invention, an enzyme in which methionine at position 85, arginine at posi- 
tion 199 and aspartic acid at position 312 are replaced with other amino acids is also able to synthesize prenyl diphos- 
phate having at least 25 carbon atoms. Thus, the present invention, in another embodiment includes an enzyme in 
which at least methionine at position 85, arginine at position 199 and aspartic acid at position 312 are replaced with 
other amino acids. In this embodiment, enzymes in which methionine at position 85, arginine at position 199 and aspar- 
tic acid at position 312 are replaced, as well as enzymes containing amino acid replacements at one or a plurality of 
sites other than at these sites or the above-mentioned mutation sites, are included. 

According to still another embodiment of the present invention, an enzyme in which phenylalanine at position 1 18 
is replaced with another amino acid can also synthesize prenyl diphosphate having at least 25 carbon atoms. Thus, in 
another embodiment, the present invention includes enzymes in which at least the amino acid at position 1 18 is 
replaced with another amino acid. In this embodiment, enzymes in which the amino acid at position 11 8 is replaced with 
another amino acid, as well as enzymes containing amino acid replacements at one or a plurality of positions of the 
above-mentioned amino acid replacement positions, are included. 

Enzymes are known to have those own specificities of enzyme activities even in the case of being modified by addi- 
tion, removal and/or replacement of one or a few amino acids. Thus, in addition to the peptides having the amino acid 
sequences shown in SEQ ID NOs: 2 to 6, the present invention also includes enzymes that the same specificity while 
having an amino acid sequence that is changed by replacing, deleting and/or adding one or a few, such as up to 5 or 
up to 10, amino acids with respect to the amino acid sequences shown in SEQ ID Nos: 2 to 6. 

Two aspartate-rich domains (sites indicated with arrows in Fig. 1) are conserved in various prenyl transferases, and 
the diphosphate site of the substrate is thought to bind to these sites. Phenylalanine at position 77 exists at the 5th posi- 
tion upstream to the N-termina! side from the N-terminal of aspartate-rich domain I present on the N-terminal side 
among these two aspartate-rich domains. This phenylalanine is replaced with a non-aromatic amino acid in 3 of the 5 
mutants of the present invention. 

Thus, in order to synthesize prenyl diphosphate having a large number of carbon atoms, for example that having 
25 or more carbon atoms, if phenylalanine at about the fifth position upstream to the N-terminal side from the amino 
acid of the N-terminal of aspartate-rich domain I is replaced with another amino acid, for example a non-aromatic amino 
acid, even in the case of a prenyl transferase other than the prenyl transferase derived from Suffolobus acidocaldarius 
having the amino acid sequence indicated in Sequence No. 1 , an enzyme is obtained that is able to synthesize prenyl 
diphosphate having a larger number of carbon atoms than the wild type enzyme. 

Thus, the present invention provides a process for producing a mutant prenyl transferase characterized by replac- 
ing phenylalanine at the 5th position upstream to the N-terminal side from the amino acid of the N-terminal of aspartate- 
rich domain I of prenyl transferase. This amino acid replacement can be performed by changing the codon that codes 
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for that amino acid. 

In addition, the present invention provides a gene coding for the various above-mentioned mutant enzymes, a vec- 
tor comprising that gene, particularly an expression vector, and a host transformed with said vector. The gene (DNA) of 
the present invention can be easily obtained by introducing a mutation into DNA that codes for the wild type amino acid 
sequence indicated in SEQ ID NO: 1, according to routine methods such as site-directed mutagenesis or PCR. 

Moreover, once the amino acid sequence of the target enzyme has been determined, a suitable nucleotide 
sequence that codes for it can be determined, thus making the mutant is possible to chemically synthesize DNA by con- 
ventional DNA synthesis methods. 

In addition, the present invention provides an expression vector comprising the DNA as described above, hosts 
transformed with said expression vector, and a process for producing an enzyme or peptide of the present invention 
using these hosts. 

Although expression vectors contain an origin of replication, expression control sequence and so forth, these vary 
according to the host, Examples of hosts include procaryotes, examples of which include bacteria such as E. coli and 
Bacillus sp. including Bacillus subtilus : eucaryotes, examples of which include yeasts such as Saccharomyces sp. 
including & cerevisiae, and Pichia sp. including Pichia pastoris : molds, examples of which include Aspergillus sp. such 
as & oryzae and & niger : animal cells, examples of which include cultured cells and cultured cells of higher animals, 
such as CHO cells. In addition, it is also possible to use plants for the host. 

According to the present invention, as indicated in Examples, geranytfarnesyl diphosphate can be accumulated in 
the culture by curturing a host transformed by the DNA of the present invention, and geranylfarnesyl diphosphate can 
be produced by recovering it from the culture. Also according to the present invention, geranylfarnesyl diphosphate can 
be produced by allowing the mutant GGPP synthase produced according to the process of the present invention to act 
on the isopentenyl diphosphate substrate and each allylic substrate such as farnesyl diphosphate. 

In an example of using E. coli for the host, gene regulation of gene expression is known to exist such as in the proc- 
ess of transcribing mRNA from DNA and the process of translating protein from mRNA. In addition to those sequences 
present in nature (e.g. lac, trp, bla. Ipp, Pl. Pr, ter, T3 and T7 as promoters), sequences in which their mutants (e.g. 
lacUVS) are artificially joined with wild type promoter sequences (e.g. tac, trc) are known as examples of promoter 
sequences that regulate mRNA transcription, and these can also be used in the present invention. 

It is known that the ribosome binding site (GAGG and other similar sequences) sequence and the distance to the 
initiation codon are important as sequences that regulate the activity to translate the mRNA to synthesize proteins. In 
addition, it is also well known that the terminator, which commands termination of transcription on the 3'-end (e.g. a vec- 
tor containing rrnPT 1 T 2 is commercially available from Pharmacia), has an effect on protein synthesis efficiency in the 
recombinant. 

Although commercially available products can be used as is for the vector that can be used for preparation of the 
recombinant vector of the present invention, various types of vectors induced according to a specific purpose can also 
be used. Examples of these include pBR322, pBR327, pKK223-3, pKK233-2 and pTrc99, originating in pMB1 and hav- 
ing the replicon, pUC18, pUCl9, pUC118, pUC119, pBluescript, pHSG298 and pHSG396, modified to improve the 
number of copies, pACYC177 and pACYC184, derived from p15A and having the replicon, as well as plasmids origi- 
nating in pSC101, Co1E1 , R1 and F factor. Moreover, expression vectors, for fused proteins, that are easier to purify, 
can also be used, examples of which include 0GEX-2T, pGEX-3X and pMal-c2, and the example of a gene used as the 
starting material in the present invention is described in Japanese Patent Application No. 6-315572. 

In addition, gene introduction can also be performed by using virus vectors and transposons such as X-phages and 
M13 phages in addition to plasmids. In the case of gene introduction into a microorganism other than E coK. gene intro- 
duction into Bacillus sp. is known using puB1 10 (sold by Sigma) or pHY300PLK (sold by Takara Shuzo). These vectors 
are described in Molecular Cloning (J. Sambrook, E.F. Fritsch, T Maniatis ed., Cold Spring Harbor Laboratory Press, 
pub.), Cloning Vector (RH. Pouwels. B.E. Enger Valk. W.J. Brammar ed.. Elsevier pub.) and various company catalogs. 

Insertion of a DNA fragment coding for GGPP synthase and, as necessary, a DNA fragment having the function of 
regulating expression of the gene of the above-mentioned enzyme, into these vectors can be performed according to 
known methods using suitable restriction enzyme and ligase. Specific examples of plasmids of the invention prepared 
in this manner include pBS-GGPSmut1 , PBS-GGPSmut2, pBS-GGPSmut3. pBS-GGPSmut4 and pBS-GGPSmut5. 

Examples of microorganisms that can be used for gene introduction with this type of recombinant vector include E 
coli and Bacillus sp. This transformation can also be performed according to routine methods such as the CaCI 2 method 
or protoplast method described in Molecular Cloning (J. Sambrook, E.F. Fritsch, T Maniatis ed., Cold Spring Harbor 
Laboratory Press pub.) and DNA Cloning Vol. I-III (D.M. Glover ed., IRL Press pub.). 

In producing the mutant enzyme of the present invention, the above-mentioned transformed cell is cultured after 
which the mutant enzyme can be collected and purified from that culture in accordance with routine methods, examples 
of which include salting out, organic solvent sedimentation, gel filtration affinity chromatography, hydrophobic inter 
action chromatography and ion exchange chromatography. 

In addition, the present invention provides a process for producing prenyl diphosphate using the enzyme of the 
present invention. In this process, the enzyme of the present invention should be allowed to react in a medium, and par- 
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fcularly an aqueous medium, and then the target prenyl diphosphate should be re covered from the reaction medium 
as desired. The enzyme may not only be purified enzyme, but also crude enzymes obtained by semi -purification 
through various stages, or a substance containing enzymes such as cultured microorganisms or the culture itself. In 
addition, the above-mentioned enzyme, crude enzyme or enzyme-containing substance may be an immobilized 
enzyme that has been immobilized in accordance with conventional methods. 

Prenyl diphosphate having fewer carbon atoms than the target prenyl diphosphate, such as 5-20 carbon atoms and 
preferably less than 5 carbon atoms, and isopentyl diphosphate are used for the substrate. Water or an aqueous buffer, 
such as phosphate buffer, are used for the reaction medium. 

EXAMPLES 

The following Examples provide a more detailed explanation of the present invention. Furthermore, the materials 
used in the following Examples can all be easily acquired by a person with ordinary skill in the art as described below. 

Strain C296-LH3 of the budding yeast, Saccharomyces cerevisiae (Tzagoloff, A. and Dieckmann, C.L (1990) 
Microbiological Reviews 54, 211 -255, Tzagoloff, A. et al. (1075) J. Bacteriol. 122, 826-831), was used for the screening 
host. 

Plasmid pG3ATI (Tzagoloff, A. and Dieckmann, C.L (1990) Microbiological Reviews 54, 211-255, Tzagoloff A. etal. 
(1975) J. Bacteriol. 122, 826-831. Ashby, M.N. and Edwards, P. A. (1990) J. Biol. Chem. 265, 13157-13164) or plasmid 
YEpG3ASpH, from which portions other than the HexPS coding region had been removed from pG3/TI (Ashby, M.N. 
and Edwards. RA. (1990) J. Biol. Chem. 265. 13157-13164). was used for the positive control plasmid containing the 
HexPS gene. 

Y-PGK, wherein the crJE gene portion had been removed from Y-cilE (Misawa. N. et al. (1990) J. Bacteriology 1 72, 
6704-6712). was used for the expression vector for library preparation. Saccharomyces cerevisiae strain A451 was 
used as a wild strain used for the positive control. 

However, the experimental materials required for the present invention are not limited to those described above, but 
rather completely similar substitutes can also be used. 

Screening host mutant strain C2960-LH3 for screening is a deficient strain for the HexPS gene. In other words, a 
budding yeast HPS gene fragment can easily be obtained from a widely known wild strain of budding yeast by PCR 
using an already known budding yeast HexPS gene sequence (GenBank™/EMBL Data Bank accession number(s) 
J05547). If this gene fragment is then used by coupling with a yeast incorporating plasmid (Yip) such as pRS403, 
pRS404. pRS405 or pRS406 (available from Stratagene), an HexPS-deficient strain can easily be created by widely 
conducted gene destruction using homologous recombination. 

In addition, it also sufficient for the positive control plasmid if this gene fragment is inserted using a widely known 
budding yeast expression vector such as pYEUra3 (available from Clonetech) and pYES2 (available from Invitrogen). 
The strain used for the positive control is not limited to strain A451 , but rather any strain is sufficient provided it retains 
the wild HexPS gene. In addition, it is sufficient to use a commercially available vector for the expression vector for 
library preparation such as pYEYra3 available from Clonetech or pYES2 available from Invitrogen. 

LKC-18 reversed phase thin layer chromatography plates were purchased from Whatman Chemical Separation, 
Inc. [1- 14 C]IPP was purchased from Amersham. 

Example 1. Plasmid Construction 

New Hindll I restriction enzyme sites were introduced both upstream and downstream of the GGPS gene (Gen- 
Bank^/EMBL Data Bank accession number D28748) of Sulfolobus acidocaldarius bv PCR using the chemically syn- 
thesized DNA primers 5'-AAGAGAAGCTTATGAGTTACTTTG AC-3 ' (SEQ ID NO: 7) and 5'- 
GATACAAGCTTTATTTTCTCC-3' (SEQ ID NO: 8). Genomic DNA was purified in accordance with routine methods from 
Sulfolobus acidocaldarius. obtainable as ATCC33909 from the American Type Culture Collection (ATCC), and its clone 
DNA was then used for the template DNA of PCR. 

The DNA fragment amplified with PCR was ligated to the Hindi II site of plasmid pBluescript (KS + ) cleaved with Hin- 
dlll to form pBS-GGPS. A crtE gene portion was removed by cleaving plasmid Y-crtE with Hindlll, and the remaining 
portion containing the PGK promoter and PGK terminator was self-ligated to form Y-PGK. The insert portion containing 
GGPS gene obtained by severing pBS<3GPS with Hindlll was introduced at the Hindlll site of Y-PGK to form Y-GGPS. 

Example 2. Random Mutagenesis of GGPS Gene 

A random mutation was introduced into the region coding for GGPS gene using nitrite according to the method of 
Myers et al. (Myers. R.M. et al. (1985) Science 229. 242-247). Single strand DNA was isolated from E coji containing 
pBS-GGPS by infection with helper phage M13K07, and this was then treated for 60 minutes with 1 M sodium nitrite. 
Next, the complementary strand was synthesized as primer using chemical synthesis DNA S'-CCCCCCTCGAGGTC- 
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G ACGGTATCG ATAA-3 ' (SEQ ID NO: 9) corresponding to the sequence of the T7 promoter portion. The GGPS gene 
portion was then extracted with Hindllf restriction enzyme, introduced at the Hindlll site of Y-PGK, and transformed to 
E c_gj strain XU-Blue to prepare the library. 

s Example 3. Yeast Transformation and Screening 

The budding yeast, Saccharomvces cerevisiae . was transformed by the spheroptast method according to the 
method of Ashby etal. (Ashby, M.N. and Edwards, P. A (1990) J. Biol. Chem. 265, 13157-13164). Namely, HexPS-defi- 
cient strain C296-LH3 was transformed with the previously described plasmid library and cultured on leucine-deficient 
10 agar plate (leu" plate) using the top agar method (3% bactoagar, 0.67% yeast nitrogen base. 0.05% yeast extract, 
0.05% bacto peptone, 1 .0 M sorbitol and 2% glucose). 

The transformant having the Leu + phenotype was inoculated onto YEPG (1% yeast extract, 2% ethanol, 2% bacto 
peptone and 3% glycerol), D (1% yeast extract, 2% ethanol, 2% bacto peptone, 3% glycerol and 0.1% glucose) and 
YPD (1% yeast extract, 2% bacto peptone and 2% glucose) agar media followed by incubation for 3 days at 30°C. 
is Clones were selected from the C2960-LH3 transformants with plasmid containing mutated GGPS that grew on the 
YEPG agar plate, grew and formed colonies larger than those of non-transformed C296-LH3 on the D plate. 

This complemented phenotype is considered to indicate that the electron transport chain is active during the respi- 
ration reaction, or in other words, that a active coenzyme Q was synthesized in the C296-LH3 cells. Five clones having 
this complemented phenotype were obtained from 1,400 transformants. As a result of retesting the resulting five clones. 
20 not only were they able to grow on YEPG agar plates, but they also possessed the ability to form colonies that were 
clearly larger than those of YEpG3ASpH/C296-LH3, having a plasmid that contains HexPS gene of yeast origin, on D 
agar plates. The plasmid DNA of these five clones were purified in accordance with routine methods. 

These plasmids were named Y-GGPSmut1 , y-GGPSmut2, Y-GGPSmut3, Y-GGPSmut4 and Y-GGPSmut5. 

Furthermore, since yeast strain C296-LH3 is deficient in HexPS activity, it is unable to biosynthesize coenzyme Q6 
25 which has a hexaprenol group on its side chain. Since coenzyme Q6 is required for non-fermentative metabolism, 
C296-LH3 forms colonies on media containing a small amount of glucose that are smaller than those of the wild strain, 
and does not grow on media that only contains a non-fermentative substrate like glycerol for the carbon source. Prior 
to screening for mutated activity, the effects of expression in wild type GGPS derived from Sulfolobus acidocaldarius 
were investigated. 

30 On the D plates, strain Y-GGPS/C296-LH3, which is strain C296-LH5 having a plasmid containing the wild type 
GGPS gene, was found to clearly form colonies smaller than those of YEpG3ASpH/C296-LH3 although intermediate to 
YEpG3ASpH/C296-LH3, possessing a plasmid containing HexPS gene of yeast origin, and C296-LH3, not possessing 
a plasmid. However. Y-GGPS/C296- LH3 was unable to grow on the YEPG plate. This screening method was therefore 
confirmed to be useful. 

35 

Example 4. Determination of DNA Nucleotide Sequence and its Analysis 

The nucleotide sequences of DNA coding for the five mutant GGPS contained in the five types of purified plasmids 
were determined using the Perkin-Elmer Model 373A Fluorescent DNA Sequencer according to the dideoxy chain ter- 
40 mination method. Analysis of the nucleotide sequences was performed using the genetic data analysis software, Mac- 
MdlyTetra. 

The amino acid substitution sites as deduced from the nucleotide sequence of each mutant GGPS are shown in 
Fig. 1 . Replacement sites were found at the nucleotide sequence level for all selected mutants. In the case of Mutant 1 
which is the Y-GGPSmut1 insertion fragment, replacements were found consisting of mutant methionine at position 85 

45 changing to isoleucine. mutant arginine at position 199 changing to lysine, and mutant aspartic acid at position 312 
changing to asparagine. In the case of Mutant 2 which is the Y-GGPSmut2 insertion fragment, the only replacement 
was mutant phenylalanine at position 118 changing to leucine. In the case of Mutant 3 which is the Y-GGPSmut3 inser- 
tion fragment, mutant Phe at position 77 changed to serine, in the case of Mutant 4 which is the Y-GGPSmut4 insertion 
fragment, mutant phenylalanine at position 77 changed to leucine and mutant valine at position 99 changed to methio- 

so nine, and in the case of Mutant 5 which is the Y-GGPSmut5 insertion fragment, mutant phenylalanine at position 77 
changed to serine and mutant tyrosine at position 101 changed to histidine. 

A high proportion of these mutations consist of an aromatic amino acid residue being replaced with a non-aromatic 
amino acid residue. Phenylalanine at position 77 in particular has the most significant effect on the chain elongation 
reaction. Phenylalanine at position 77 is located at the five residues upstream from the N-terminal residue of an aspar- 

55 tate-rich domain I. There are two aspartate-rich domain motifs (DDXX(XX)D) that are conserved in prenyl transferase. 
The diphosphate portion of the substrates are believed to bind here. The amino acid residue located at the fifth position 
upstream from the N-terminal residue of this aspartate-rich domain, which was focused on for the first time in the 
present invention, is considered to be extremely important in determining the chain length of the reaction product. 
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Example 5 

A crude extract was prepared from the five selected clones (Y-GGPSmut1/C296-LH3, Y-GGPSmut2/C296-LH3, Y- 
GGPSmut3/C296-LH3, YGGPSmut4/C296-LH3 and Y-GGPSmut5/C-296-LH3) according to the method of Itoh et al. 
(Itoh, N. etal. (1984) J Biol. Chem. 259, 13923-13929). 

Namely, the above-mentioned yeast was incubated for 4 days at 30°C. Approximately 400 of cells were collected 
by centrifugation and washed once with 800 uJ of buffer A (50 mM Tris HCI pH 7.5, 5 mM MgCI 2 , 50 mM dithiothreitol, 
1 M sorbitol). The cells were then suspended in 1.2 mM butter B (50 mM Tris HCI pH 7.5, 5 mM MgCI 2 , 3 mM dithioth- 
reitol, 1 M sorbitol) followed by the addition of 0.8 mg of zymolyase and incubation for 1 hour at 30°C. 

The prepared spheroblasts were washed three times with buffer B and suspended in 1 ml of buffer C (50 mM Tris 
HCI pH 7.0, 10 mM 2-mercaptoethanol, 1 mM phenylmethanesulfonyi fluoride, 1 mM EDTA). Ultrasonic treatment was 
performed 10 times on the suspension in ice at two minute intervals, performing treatment for 10 seconds at a time at 
maximum output using a Branson Sonifier. The lysates were incubated for 1 hour at 55°C, and after inactivating prenyl 
transferase^) of the host cells, the lysates were centrrfuged for 10 minutes at 10,000 x g. The resulting supernatant was 
used as a mutant GGPS crude enzyme solution and assay of prenyl transferase activity. 

The results of performing an assay of prenyl transferase activity by LKC-18 thin layer chromatography using this 
mutant GGPS crude enzyme liquid prepared from yeast are shown in Figs. 2 and 3. 

After carrying out the enzyme reaction at 55°C, polyprenyl diphosphate was extracted with 1-butanol after which 
the 1-butanol was evaporated with a nitrogen gas flow. The resulting polyprenyl diphosphate was treated with acid 
phosphatase in accordance with the method of Fujii et al. (Fujii et al. (1982) Biochim. Biophys. Acta. 712, 716-718). The 
hydrolysis product was extracted with pentane and after performing thin layer chromatography using acetone/H 2 0 (9:1) 
for the developing solution, the distribution of radioactivity was analyzed with the Fuji Rim Model BAS2000 Bio-image 
Analyzer. The alcohols as the authentic standards, on which thin layer chromatography was performed simultaneously, 
followed by staining with iodine vapor (geranyol, farnesol, geranylgeraniol), were used to determine the developing loca- 
tions. 

Fig. 2 shows the result of reacting using [1 - 14 C]IPP and (all-E)-FPP for the substrates, while Fig. 3 shows the result 
of reacting using [1 - 14 C]IPP and (all-E)-GGPP for the substrates. Spots a through c correspond to the authentic stand- 
ard samples, a indicating geraniol, b indicating (all-E)-farnesol, and c indicating (atl-E)-geranylgeranyol. Ori indicates 
the sample-stopping point, S.F. indicates the solvent front. 

On the basis of these results, in the case of using GGPP for the allylic substrate, it was shown that each mutant 
GGPS is able to synthesize geranytfarnesyl diphosphate (GFPP) that is one isoprene unit longer than the reaction prod- 
uct of the wild type enzyme. On the other hand, the wild type GGPS is unable to synthesize the reaction product same 
as or longer than the chain length of GFPP at a level that allows detection. In the case of using FPP for the allylic sub- 
strate, the product ratio of GGPP/GFPP indicated by the mutant GGPS was different from each other. 

Example 6. Preparation of MutantGGPS from E. coli .. . . .-, - 

In order to ensure that the analysis is performed more accurately, each mutant GGPS was over expressed in E. coli 
strain of XL 1-Blue. Namely, each of the five plasmids Y-GGPSmut1 . Y-GGPSmut2, Y-GGPSmut3, Y-GGPSmut4 and Y- 
GGPSmutS obtained in screening was digested with Hindlll to obtain Hindlll DNA fragments that code tor the mutant 
GGPS. These Hindlll DNA fragments were inserted at the Hindlll site of the plasmid vector pBluescript (KS(+)) to obtain 
pBS-GGPSmut1, pBS-GGPSmut2, pBS-GGPSmut3, pBS-GGPSmut4 and pBS-GGPSmut5 respectively. 

Ecoji XL1-Blue was transformed with pBS-GGPSmut1 , pBS-GGPSmut2, pBS-GGPSmut3. pBS-GGPSmut4 and 
pBS-GGPSmut5 and cultured according to the method described in Molecular Cloning (Sambrook. J. et al. (1989) 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New Yort^. After 
collecting the bacterial cells, the bacterial cells were ultrasonically homogenized in 50 mM Tris HCI buffer containing 1 0 
mM 2-mercaptoethanol and 1 mM EDTA After heat treating the homogenate for 1 hour at 55°C, it was centrifuged for 
10 minutes at 100,000 x g. The supernatant was then collected as the crude enzyme solution which was assayed for 
PTase activity. 

Assay was performed by analysis of product with LKC-18 thin layer chromatography and by determination of 
enzyme activity. For thin layer chromatography, DMAPP, GPP, (all-E)-FPP and (all-E)-GGPP were used for the allylic 
substrates, and after reacting in the same manner as Example 5, LKC thin layer chromatography was performed in the 
same manner as Example 5. Those results are shown in Figs. 4 and 5. 

Fig. 4(A) is the result of reacting [1- 14 C]IPP with DMAPP for the substrate, and (B) is the result of reacting [1- 
14 QIPP with GPP for the substrate. Fig. 5(C) is a result of reacting [1 - 14 C]1PP with (all-E)-FPP for the substrate, while 
(D) is a result of reacting [1- 14 C]IPP with (all-E)-GGPP for the substrate. Ellipses a. through c show the positions of the 
authentic standard samples, a indicating geraniol. b indicating (all-E)-farnesol and c_ indicating (all-E)-geranylgeraniol. 
Ori indicates the sample-spoting point, while S.F. indicates the solvent front. 

The prenyl transferase activity was assaied as follows. 1 ml of assay mixture, containing 25 nmol of [1- 14 C]IPP (37 
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GBq/mol). 25 nmol of allylic substrate (DMAPP. GPP, (all-E)-FPP or (all-E)-GGPP). 5 jimol of MgCI 2 . 10 nmol of phos- 
phate buffer (pH 5.8) and the enzyme solution, was incubated for 60 minutes at 55°C. 

The reaction was stopped by cooling rapidly on ice. After adding 3.5 ml of water-saturated 1-butanol to the chilled 
mixture and shaking vigorously, the 1-butanol layer was washed with NaCI -saturated water and 14 C radioactivity was 
measured with a liquid scintillation counter. 1 unit of enzyme activity was defined as the amount for which 1 nmol of [1- 
14 C]IPP is incorporated into elongated prenyl diphosphate (polyprenyl diphosphate) that can be extracted with the 1- 
butanol layer. Those results are shown in the Table. 



Table 





ity (dpm) 


Product Distribution 






GPP 


FPP 


GGPP 


GFPP 


FFPP 






% 


% 


% 


% 


% 


Mutant 1 














DMAPP 


31,800 


23.2 


8.77 


29.6 


38.0 


0.45 


GPP 


5.260 


nd* 


38.8 


30.9 


30.4 


0.02 


FPP 


4,340 


nd* 


nd* 


65.1 


35.0 


nd* 


GGPP 


998 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 2 














DMAPP 


15,800 


1.44 


0.66 


89.0 


8.86 


nd* 


GPP 


7,050 


nd* 


20.3 


74.9 


4.89 


nd* 


FPP 


6,080 


nd* 


nd* 


89.5 


10.5 


nd* 


GGPP 


379 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 3 














DMAPP 


24,900 


3.40 


27.4 


16.6 


51.6 


0.92 


GPP 


9,890 


nd* 


64.7 


9.37 


24.5 


1.44 


FPP 


7,280 


nd* 


nd* 


30.4 


69.6 


nd* 


GGPP 


3,200 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 4 














DMAPP 


16,700 


4.93 


4.07 


73.2 


17.8 


nd* 


GPP 


7,460 


nd* 


38.4 


51.3 


10.3 


nd* 


FPP 


5,650 


nd* 


nd* 


85.9 


14.1 


nd* 


GGPP 


551 


nd* 


nd* 


nd* 


100 


nd* 


Mutant 5 














DMAPP 


23,600 


27.1 


18.6 


12.8 


40.4 


1.12 


GPP 


9.070 


nd* 


59.3 


13.0 


26.1 


1.56 


FPP 


8,960 


nd* 


nd* 


32.0 


68.0 


nd* 


GGPP 


2,200 


nd* 


nd* 


nd* 


100 


nd* 


Wild type 














DMAPP 


13,600 


5.61 


0.43 


94.0 


nd* 


nd* 


GPP 


6,640 


nd* 


17.2 


82.8 


nd* 


nd* 


FPP 


4,650 


nd* 


nd* 


100 


nd* 


nd* 


GGPP 


nd* 


nd* 


nd* 


nd* 


nd* 


nd* 


nd: Not detected 
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Each mutant GGPS exhibited activity that synthesizes polyprenyl diphosphate having a longer chain length than 
GGPP. The wild type GGPS as well as each mutant GGPS reacted the best with OMAPP amongst the four allylic sub- 
strates. In addition, relative activity when allylic substrates were used that had a shorter chain length than FPP exhibited 
similar values. However, relative activity and product distribution when GGPP was used for the allylic substrate were 
considerably different. 

When DMAPP. GPP and FPP were used for the allylic substrates. Mutant 1, which is coded for by the insert DNA 
of ptasmid pBS-GGPSmutl, yielded the major reaction products of GFPP and GGPP. In particular, when DMAPP was 
used for the allylic substrate, only a slight amount of hexaprenyl diphosphate (HexPP) was detected as the reaction 
product. Although the distribution of reaction products varied between each allylic substrate, the proportion of product 
produced in one cycle of the condensation reaction was large. 

In the case of Mutant 2 coded for by the insert DNA of plasmid pBS-GGPSmut2, the major product was GGPP and 
the proportion of GFPP was roughly 10%. HexPP was not detected. 

Mutant 3, which is coded for by the insert DNA of plasmid pBS-GGPSmut3, and Mutant 5, which is coded for by the 
insert DNA of plasmid pBS-GGPSmutS, demonstrated similar characteristics. These mutants exhibited strong GFPP 
synthesis activity, while aiso synthesizing a small amount of HexPP. 

Mutant 4, which is coded for by the insert DNA of plasmid pBS-GGPSmut4, yielded GGPP as the major product, 
while the proportion of GFPP was roughly 15%. FPP was effectively synthesized when GPP was used for the allylic 
substrate. 
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SEQUENCE LISTING 

SEQ ID NO; 1 

Sequence Length: 993 

Sequence Type: Nucleic acid 

Strandness: Double strand 

Topology: Linear 

Molecular Type: Genomic DNA 

Source 

Organism: Su If globus *r.\ dncal darius 
Sequence 

ATC AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn 

5 10 
GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT OCT AAA 
Asp He IU Lys Ser Tyr He Ser Gly Asp Val Pro Lys 

20 25 
GCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA 

* 

Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu 
35 40 ^5 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg 

50 55 60 

TAT TAT GCA GGT GCA GCT ATT GAA GTT CTT CAT ACT TTT 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Phe 

65 70 75 

CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly 

85 90 
GTC CAC GTG AAA TAC GGC TTA CCC TTA GCA ATA TTA GCT 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala 

100 105 
CTA CAT GCA AAG GCT TTT CaG CTC TTA ACC CAG GCT CTT 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu 
115 120 125 



TCT GTA 
Ser Val 
15 

CTA TAT 
Leu Tyr 

30 
AGA CCA 
Arg Pro 

GAA AGA 
Glu Arg 

ACG CTT 
Thr Leu 

TTA CCC 
Leu Pro 
95 

GGG GAT 
Gly Asp 
110 

AGA GGT 
Arg Gly 



AAC 
Asn 

GAA 
Glu 

TTA 
Leu 

GCT 
Ala 

GTG 
Val 
60 
ACA 
Thr 

TTA 
Leu 

TTG 
Leu 



48 



96 



192 



2C0 



288 



336 



36<* 
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CCA AGT CAA ACC ATA ATT AAG CCT TTC GAT ATT TTC ACT CGT TCA ATA 432 
Pro Ser Glu Thr lie He Lys Ala Phe Asp lie Phe Thr Arg Ser He 
130 135 14o 

ATA ATT ATA TCC CAA CCA CAC CCA GTA CAT ATG GAA TTT GAG CAC AGA 480 

He He He Ser Glu Gly Gin Ala V a l Asp Met Clu Phe Clu Asp Arg 

" 5 150 155 160 

ATT GAT ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 528 

He Asp He Lys Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

165 170 175 

GCT CCA TTA TTC TCG GCA TCC TCA AGT ATA GGC GCA CTT ATT CCT GGT 576 
Ala Ala Leu Phe Ser Ala Ser Ser Scr He Gly Ala Leu He Ala Gly 

180 185 190 

GCT AAT GAT AAT GAT GTA AGA CTG ATG TCT GAT TTC GGT ACG AAT CTA 624 
Ala Asn Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

195 200 205 

GGT ATT GCA TTT CAG ATT GTT GAC GAT ATC TTA GGT CTA ACA GCA GAC 6 72 
.Gly He Ala Phe Gin He Val Asp Asp He Leu Gly Leu Thr Ala Asp 

210 215 220 

GAA AAG GAA CTT GGA AAC CCT GTT TTT AGT GAT ATT ACG CAG CGT AAA 720 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Clu Gly Lys 
225 230 235 240 

AAG ACT ATA CTT GTA ATA AAA ACA CTG GAG CTT TGT AAA GAG GAC CAG 768 
Lys Thr He Leu Val He Lys Thr Leu Clu Leu Cys Lys Glu Asp Glu 

215 250 255 

AAG AAG ATT GTC XTA AACCCG 'TTA GGT AAT AAG TCA GCC TCA AAA GAA " 816 
Lys Lys He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 270 

GAA TTA ATG AGC TCa GCA GAT ATA ATT AAG AAA TAC TCT TTA GAT TAT 864 
Glu Leu Met Ser Ser Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

275 2« 0 285 

CCA TAC AAT TTA GCA GAG AAA TAT TAT AAA AAT GCT ATA GAC TCT TTA 912 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT GAT ATA CCT GGA AAG GCT TTA AAA TAT 960 
Asn Cln Val Ser Ser Lys Ser Asp lie Pro Gly Lys Ala Leu Lys Tyr 
305 no 315 3Z0 
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CIA CCT GAA TTT ACG ATa AGA aGG AGa AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys TER 

325 330 



SEQ ID NO: 2 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT CaC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu lie Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT GAA 
Asp He lie Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT AC A TCT GGA GGT AAG AGG TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 
He Leu Thr lie Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Aeg Ala 

SO 55 60 

TAT TAT GCA GGT GCA GCT ATT GAA GTT CTT CAT ACT TTT ACG CTT GTG 
Tyr Tyr Ala Gly Ala Ala He Glu v a l Leu His Tht Phe Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATA GAT CAA CAT AAT ATC AGA AGA GGG TTA CCC ACA 
His Asp Asp He He Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC CTG AAA TAC GGC TTA CCC TTA CCA ATA TTA GCT GGG GAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 

CTA CAT GCA AAG GCT TTT CaG CTC TTA ACC CAG GCT CTT AGA GGT TTG 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 125 
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CCA AGT 
Pro Ser 
130 
ATA ATT 
He He 
145 

ATT GAT 
He Asp 

CCT CCA 
Ala Ala 

GCT AAT 
Ala Asn 

GGT ATT 
Gly lie 
210 
GAA AAG 
Glu Lys 
22S 

AAG act 
Lys Thr 

AAC AAG 
Lys Lys 

GAA TTA 
Glu Leu 

GCA TAC 
Ala Tyr 
29D 
AAT CAA 
Asn Gin 
305 



GAA ACC 
Glu Thr 

ATA TCC 
He Ser 

ATA AAG 
He Lys 

TTA TTC 
Leu Phe 
180 
GAT AAT 
Asp Asn 
195 

GCA TTT 
Ala Phe 

GAA CTT 
Glu Leu 

ATA CTT 

He Leu 

ATT GTC 
He Val 
260 
ATC AGC 
Met Ser 
273 

AAT TTA 
Asn Leu 

GTC TCC 
Val Ser 



ATA ATT 
He He 

GAA GCA 
Glu Cly 
150 
GAG CAG 
Glu Gin 
165 

TCC GCA 
Ser Ala 

GAT GTA 
Asp Val 

CAG ATT 

Gin He 

CGA AAG 
Gly Lys 
230 
GTA ATA 
Val He 
245 

CTA AAG 
Leu Lys 



AAG GCT 
Lys Ala 

135 

CAG GCA 
Gin Ala 

GAA TAC 
Glu Tyr 

TCC TCA 
Ser Ser 

AAA CTG 
Lys Leu 
200 
GTT GAC 
Val Asp 
215 

CCT GTT 
Pro Val 

AAA ACA 
Lys Thr 

GCC TTA 
Ala Leu 



TCA CCA GAT ATA 
Ser Ala Asp He 
280 

GCA GAC AAA TAT 
Ala Glu Lys Tyr 
295 

TCT AAG AGT AAT 
Ser Lys Ser Asn 
310 



TTC GAT 
Phe Asp 

GTA GAT 
Val Asp 

CTT GAC 
Leu Asp 
170 
AG I ATA 
Ser He 
185 

ATG TCT 
Met Ser 

GAT ATC 
Asp He 

TTT AGT 
Phe Ser 

CTG GAG 
Leu Glu 
230 
GGT AAT 
Gly Asn 
265 

ATT AAG 
He Lys 



ATT TTC 
lie Phe 
140 
ATG CAA 
Met Glu 
155 

ATG ATC 
Met He 

GCC GCA 
Gly Ala 

GAT TTC 

Asp Phe 

TTA GGT 
Leu Cly 
220 
GAT ATT 
Asp He 
235 

CTT TGT 

Leu Cys 

AAG TCA 
Lys Ser 

AAA TAC 
Lys Tyr 



ACT CCT TCA ATA 
Thr Arg Ser He 



432 



TAT AAA AAT GCT 
Tyr Lys Asn Ala 
300 

ATA CCT GGA AAG 
He Pro Gly Lys 
315 



TTT GAG 
Phe Glu 

TCA CCT 
Ser Arg 

CTT ATT 
Leu He 
190 
GGT ACG 
Gly Thr 
205 

CTA ACA 
Leu Thr 

aGG GAG 
Arg Glu 

AAA GAG 
Lys Glu 

CCC TCA 
Ala Ser 
270 
TCT TTA 
Ser Leu 
285 

ATA GAC 
He Asp 



GAC AGA 
Asp Arg 
160 
AAG ACA 
Lys Thr 
175 

GCT GCT 
Ala Gly 

AAT CTA 
Asn Leu 

GCA GAC 
Ala Asp 

GGT AAA 
Cly Lys 
240 
GAC GAG 
Asp Glu 
255 

AAA GAA 
Lys Glu 

GAT TAT 
Asp Tyr 

TCT TTA 
Ser Leu 



CCT TTA AAA TAT 
Ala Leu Lys Tyr 
320 



480 



528 



576 



624 



672 



720 



768 



816 



B64 



912 



960 



55 



14 



EP 0 763 542 A2 



CTA CCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Acg Lys TER 

325 330 



SEQ ID NO: 3 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu lie Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT GAA 
Asp He He Lys Ser Tyr lie Ser Gly Asp Val Pro Lys Lea Tyr Glu 

20 25 30 

CCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA ACA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 * 5 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA ACA GCT 
He Leu Thr lie Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT CCA GGT GCA GCT ATT GAA GTT CTT CAT ACT TTT ACG CTT GTG 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Phe Thr Leu Val 
65 70 75 60 

CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

fi5 90 95 

GTC CAC GTG AAA TAC GGC TTA CCC TTA GCA ATA TTA CCT GGG GAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 HO 

CTA CAT GCA AAG GCT CTT CAC CTC TTA ACC CAC GCT CTT AGA GGT TTG 
Leu His Ala Lys Ala Leu Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 1^5 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



CCA AGT GAA ACC ATA ATT AAG CCT TTC CAT ATT TTC ACT CGT TCA ATA 432 
Pro Set Glu Thr He He Lys Ala Phe Asp He Phe Thr Arg Ser He 
130 135 1UQ 

ATA ATT ATA TCC GAA GGA CAG CCA GTA GAT ATG GAA TTT GAG GAC AGA 480 

He He He Ser Glu Gly Cln Ala Val Asp Met Glu Phe Glu Asp Arg 

145 150 155 160 

ATT GAT ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 528 

He Asp He Lys Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lys Thr 

165 170 175 

GCT CCA TTA TTC TCG GCA TCC TCA AGT ATA CGC GCA CTT ATT GCT GGT 576 
Ala Al» Leu Phe Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

180 lfl5 190 

GCT AAT GAT AAT GAT GTA AGA CTG ATG TCT GAT TTC GGT ACG AAT CTA 624 
Ala Asn Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

193 200 205 

GGT ATT GCA TTT CAG ATT GTT GAG GAT ATC TTA GGT CTA ACA GCA GAC 672 
Gly He Ala Phe Gin He Val Asp Asp He Leu Gly Leu Thr Ala Asp 

210 215 220 

GAA AAG GAA CTT GGA AAG CCT GTT TTT AGT GAT ATT AGG CAG GGT AAA 720 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Glu Gly lys 
225 230 235 240 

AAG ACT ATA CTT GTA ATA AAA ACA CTG GAC CTT TGT AAA GAG GAC GAG 768 
Lys Thr He Leu Val He Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 

245 250 255 

AAG AAG ATTGTC CTAAAG GCG TTA GGT AAT AAG TCA GCC TCA AAA GAA 816 
Lys Lys He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 270 

GAA TTA ATG AGC TCA GCA CAT ATA ATT AAG AAA TAC TCT TTA CAT TAT 864 
Glu Leu Met Ser Ser Ala ASp He lie Lys Lys Tyr Ser Leu Asp Tyr 

275 280 285 

GCA TAC AAT TTA GCA GAG AAA TAT TAT AAA AAT CCT ATA GAC TCT TTA 912 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT GAT ATA CCT GGA AAG CCT TTA AAA TAT 9 60 
Asn Cln Val Ser Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 
305 310 315 320 
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CTA GCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys 

325 330 

SEQ ID NO: 4 
Sequence Length: 993 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser V a l Asn 

5 10 15 

GAC ATT ATT AAG AGC TAT ATA TCT GGA GAT GTT CCT AAA CTA TAT GAA 
Asp He He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 Z5 30 

GCC TCA TAT CAT TTG TTT ACA TCT GGA CGT AAG AGG TTA AGA CCA TTA 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 AO 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT GCa CCT GCA GCT ATT GAA GTT CTT CAT ACT TCT- ACG CTT CTG 
Tyr Tyr Ala Gly Ala AU He Glu Val Leu His Thr Ser Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC CTG AAA TAC CGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 
Val His Val Lys Tyr Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 

CTA CAT GCA AAG GCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GGT TTG 
Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 
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CCA ACT GAA ACC ATA ATT AAG GCT TTC GAT ATT TTC ACT CGT TCA ATA 
Pro Ser Glu Thr He He Lys Ala Phe Asp He Phe Thr Arg Ser He 

130 135 140 

ATA ATT ATA TCC GAA GGA CAG CCA GTA GAT ATG GAA TTT GAG GAC AGA 
He He lie Ser Glu Cly Gin Ala Val Asp Met Glu Phe Glu Asp Arg 
145 150 155 160 

ATT GAT ATA AAG GAG CAG GAA TAC CTT GAC ATG ATC TCA CGT AAG ACA 
He Asp He Lys Glu Gin Glu Tyr Leu Asp Met He Ser Arg Lyg Thr 

165 170 175 

GCT GCA TTA TTC TCG GCA TCC TCA ACT ATA GGC GCA CTT ATT GCT GGT 
Ala Ala Leu Phe Ser Ala Ser Ser Ser He Gly Ala Leu He Ala Gly 

180 183 190 

GCT AAT GAT AAT GAT GTA ACA CTG ATG TCT GaT TTC GGT ACG AAT CTA 
Ala Asn Asp Asn Asp Val Arg Leu Met Ser Asp Phe Gly Thr Asn Leu 

195 200 205 

GGT ATT GCA TTT CAG ATT CTT GAC GAT ATC TTA GGT CTA ACA CCA GAC 
Gly He Ala Phe Gin He Val Asp Asp He Leu Cly Leu Thr Ala Asp 

210 215 220 

GAA AAG GAA CTT GGA AAC CCT GTT TTT AGT GAT ATT ACG GAG GGT AAA 
Glu Lys Glu Leu Gly Lys Pro Val Phe Ser Asp He Arg Glu Gly Lys 
225 230 235 240 

AAG ACT ATA CTT GTA ATA AAA ACA CTG GAG CTT TGT AAA GAG CaC GAG 
Lys Thr He Leu Val lie Lys Thr Leu Glu Leu Cys Lys Glu Asp Glu 

245 230 255 

AAG AAC ATT GTC CTA AAG GCG TTA GGT AAT AAG TCA GCC TCA AAA . GAA 
Lys Lys He Val Leu Lys Ala Leu Gly Asn Lys Ser Ala Ser Lys Glu 

260 265 270 

CAA TTA ATG AGC TCA GCA GAT ATA ATT AAG AAA TaC TCT TTA CAT TAT 
Glu Leu Met Ser Sec Ala Asp He He Lys Lys Tyr Ser Leu Asp Tyr 

275 280 285 

GCA TAC AAT TTA GCA CAG AAA TAT TAT AAA AAT GCT ATA GAC TCT TTA 
Ala Tyr Asn Leu Ala Glu Lys Tyr Tyr Lys Asn Ala He Asp Ser Leu 

290 295 300 

AAT CAA GTC TCC TCT AAG AGT GAT ATA CCT GGA AAG GCT TTA AAA TAT 
Asn Gin Val Ser Ser Lys Ser Asp He Pro Gly Lys Ala Leu Lys Tyr 
305 310 315 320 
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CTA GCT GAA TTT ACG ATA AGA AGG AGa AAA TAA 
Leu Ala Glu Phe Thr He Arg Arg Arg Lys T£R 

325 330 



SEQ ID NO: 5 
Sequence Length: 99 3 
Sequence Type: Nucleic acid 
Strandnessr Double strand 
Topology; Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG ACT TAC TTT CAC AAC TaT TTT AAT GAG ATT GTT AAT TCT GTA aAC *S 
Met Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser Val Asn 

5 10 IS 

GAC ATT ATT AAG AGC TAT ATA TCT GGA CAT GTT CCT AAA CTA TAT GAA 96 
Asp Tie He Lys Ser Tyr He See Gly Asp Val Pro Lye Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT ACA TCT CGA GGT AAG AGG TTA AGA CCA TTA 144 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 
30 35 40 45 

ATC TTA ACT ATA TCA TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 192 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT GCA CGT GCA GCT ATT GAA GTT CTT CAT ACT CTT ACG CTT. GTG ;\ 240 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Leu Thr Leu Val 
65 70 75 *0 

CAT GAT GAT ATT ATG GAT CAA GAT AAT ATC AGA AGA GGG TTA CCC ACA 288 
His Asp Asp He Met Asp Gin Asp Asn He Atg Arg Gly Leu Pro Thr 

85 90 9 5 

GTC CAC ATG AAA TAC GGC TTA CCC TTA GC A ATA TTA CCT GGG GAT TTA 336 
Val His Met Lys Tyr Gly Leu Pro Leu Ala lie Leu Ala Gly Asp Leu 

100 105 H° 

CTA CAT GCA AAG GCT TTT CaG CTC TTA ACC CAC GCT CTT AGA GGT TTG 364 
Leu His Ala Lys Ala Phe CLn Leu Leu Thr Gin Ala Leu Arg Gly Leu 
115 120 125 
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CCA AGT GAA ACC 
Pro Ser Glu Thr 
130 

ATA ATT ATA TCC 
lie lie He Ser 
145 

ATT GAT ATA AAG 
lie Asp He Lys 

GCT CCA TTA TTC 
Ala Ala Leu Phe 
180 

GCT AAT GAT AAT 
Ala Asn Asp Asn 
195 

GGT ATT GCA TTT 
Gly He Ala Phe 
210 

CAA AAG GAA CTT 
Glu Lys Glu Leu 
225 

AAG ACT ATA CTT 
Lys Thr He Leu 



50 



AAG AAG 
Lys Lys 

CAA TTA 
Glu Leu 

GCA TAC 
Ala Tyr 
290 
AAT CAA 

Asn Gin 
305 



ATT GTC 
lie Val 
260 
ATG AGC 
Met Ser 
275 

AAT TTA 

Asn Leu 

GTC TCC 
Val Ser 



ATA ATT 
He He 

GAA GCA 
Glu Gly 
150 
GAG CAG 
Glu Gin 
165 

TCG GCA 
Ser Ala 

GAT GTA 
Asp Val 

CAG ATT 
Gin He 

GGA AAG 
Gly Lys 
230 
GTA ATA 
Val He 
245 

CTA AAG 
Leu Lys 



AAG GCT 
Lys Ala 
135 

CAG GCA 
Gin Ala 

GAA TAC 
Glu Tyr 

TCC TCA 
Ser Ser 

AGA CTG 
Arg Leu 
200 
GTT CAC 
Val Asp 
215 

CCT GTT 
Pro Val 

AAA ACA 

Lys Thr 

CCG TTA 
Ala Leu 



TCA CCA GAT ATA 
Ser Ala Asp He 
280 

GCA GAG AAA TAT 
Ala Glu Lys Tyr 
295 

TCT AAG AGT GAT 
Ser Lys Ser Asp 
310 



TTC GaT 
Phe Asp 

GTA GAT 
Val Asp 

CTT GAC 
Leu Asp 
170 
ACT ATA 
Ser He 
185 

ATG TCT 
Met Ser 

GAT ATC 
Asp He 

TTT AGT 
Phe Ser 

CTG CAG 
Leu Glu 
250 
GGT AAT 
Gly Asn 
265 

ATT AAG 
He Lys 

TAT AAA 
Tyr Lys 

ATA CCT 
He Pro 



ATT TTC 

He Phe 
liO 
ATG GAA 
Met Glu 

155 

ATG ATC 
Met He 

GGC GCA 
Gly Ala 

GAT TTC 
Asp Phe 

TTA GGT 
Leu Gly 
220 
GAT ATT 
Asp He 
235 

CTT TGT 
Leu Cys 

AAG TCA 
Lys Ser 

AAA TAC 
Lys Tyr 

AAT GCT 
Asn Ala 
300 
GGA AAG 
Gly Lys 
315 



ACT CCT TCA ATA 
Thr Arg Ser He 



632 



TTT GAG 
Phe Glu 

TCA CGT 
Ser Arg 

CTT ATT 
Leu He 
190 
GGT ACG 
Gly Thr 
205 

CTA ACA 
Leu Thr 

ACG GAG 
Arg Glu 

AAA GAG 
Lys Glu 

GCC TCA 
Ala Ser 
270 
TCT TTA 
Ser Leu 
285 

ATA GAC 
He Asp 



GAC AGA 
Asp Arg 
160 
AAG ACA 
Lys Thr 
175 

GCT GGT 
Ala Gly 

AAT CTA 
Asn Leu 

GCA GAC 
Ala Asp 

GCT AAA 
Gly Lys 
240 
GAC GAG 
Asp Glu 
255 

AAA GAA 
Lys Glu 

GAT TAT 
Asp Tyr 

TCT TTA 
Ser Leu 



GCT TTA AAA TAT 
Ala Leu Lys Tyr 
320 



480 



526 



576 



624 



672 



720 



768 



516 



864 
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CTA CCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 
Leu Ala Glu Phe Thr lie Arg Arg Arg Lys TER 

325 330 
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SEQ ID NO: 6 
Sequence Length: 9 93 
Sequence Type: Nucleic acid 
Strandness: Double strand 
Topology: Linear 

Molecular Type: Mutated genomic DNA 
Sequence 

ATG AGT TAC TTT GAC AAC TAT TTT AAT GAG ATT GTT AAT TCT GTA AAC i8 
M«t Ser Tyr Phe Asp Asn Tyr Phe Asn Glu He Val Asn Ser Val Asn 

5 10 15 

GAC ATT ATT AAC AGC TAT ATA TCT CGA GAT GTT CCT AAA CTA TAT GAA 96 
Asp He He Lys Ser Tyr He Ser Gly Asp Val Pro Lys Leu Tyr Glu 

20 25 30 

GCC TCA TAT CAT TTG TTT ACA TCT GGA GGT AAG AGG TTA AGA CCA TTA 144 
Ala Ser Tyr His Leu Phe Thr Ser Gly Gly Lys Arg Leu Arg Pro Leu 

35 40 05 

ATC TTA ACT ATA TCa TCA GAT TTA TTC GGA GGA CAG AGA GAA AGA GCT 192 
He Leu Thr He Ser Ser Asp Leu Phe Gly Gly Gin Arg Glu Arg Ala 

50 55 60 

TAT TAT GCA GGT GCA CCT ATT GAA GTT CTT CAT ACT TCT ACG CTT GTG . . 240 
Tyr Tyr Ala Gly Ala Ala He Glu Val Leu His Thr Ser Thr Leu Val 
65 70 75 80 

CAT GAT GAT ATT ATG GAT CAA GAT aAT ATC AGA AGA CGG TTA CCC ACA 283 
His Asp Asp He Met Asp Gin Asp Asn He Arg Arg Gly Leu Pro Thr 

85 90 95 

GTC CAC GTG AAA CAC GGC TTA CCC TTA GCA ATA TTA GCT GGG GAT TTA 336 
Val His Val Lys His Gly Leu Pro Leu Ala He Leu Ala Gly Asp Leu 

100 105 110 

CTA CAT GCA AAG GCT TTT CAG CTC TTA ACC CAG GCT CTT AGA GGT TTG 3S4 
50 Leu His Ala Lys Ala Phe Gin Leu Leu Thr Gin Ala Leu Arg Gly Leu 

115 120 125 



30 



35 



40 



45 



55 



EP 0 763 542 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



CCA AGT 
Pro Ser 
130 
ATA ATT 
lie He 

ATT GAT 
He Asp 

CCT GCA 
Ala Ala 

GCT AAT 
Ala Asn 

GGT ATT 
Gly He 
210 
GAA AAG 
Glu Lys 
225 

AAC ACT 
Lys Thr 

AAC AAG 
Lys Lys 

GAA TTa 
Glu Leu 

GCA TAC 
Ala Tyr 
290 
AAT CAA 
Asn Gin 
305 



GAA ACC 
Glu Thr 

ATA TCC 
lie Ser 

ATA AAG 
He Lys 

TTA TTC 
Leu Phe 
180 
GAT AAT 
Asp Asn 
195 

GCA TTT 
Ala Phe 

GAA CTT 
Glu Leu 

ATA CTT 
He Leu 

ATT GTC 
He Val 
260 
ATG AGC 
Met Ser 
275 

AAT TTA 
Asn Leu 

GTC TCC 
Val Ser 



ATA ATT 

He He 

GAA GGA 
Glu Cly 
150 
GAG CAG 
Clu Gin 
165 

TCG GCA 
Ser Ala 

GAT GTA 

Asp Val 

CAG ATT 
Gin He 

GGA AAG 
Gly Lys 
230 
GTA ATA 
val He 
245 

CTA AAG 
Leu Lys 



AAG GCT 
Lys Ala 

135 

CAG GCA 
Gin Ala 

GAA TAC 
Glu Tyr 

TCC TCA 
Ser Ser 

AG A CTG 
Arg Leu 
200 
GTT GAC 
Val Asp 
215 

CCT GTT 
Pro Val 

AAA ACA 
Lys Thr 

CCG TTA 
Ala Leu 



TCA GCA GAT ATA 
Ser Ala Asp He 
2fl0 

CCA GAC AAA TaT 
Ala Glu Lys Tyr 
295 

TCT AAG ACT GAT 
Ser Lys Ser Asp 
310 



TTC GAT 
Phe Asp 

GTA GAT 
Val Asp 

CTT GAC 
Leu Asp 
170 
AGT ATA 
Ser He 
185 

ATG TCT 
Met Ser 

GAT ATC 
Asp He 

TTT AGT 
Phe Ser 

CTG GAG 
Leu Glu 
250 
GGT AAT 
Gly Asn 
265 

ATT AAC 
He Lys 

TAT AAA 
Tyr Lys 

ATA CCT 
He Pro 



ATT TTC ACT CGT 
He Phe Thr Arg 
140 

ATG GAA TTT GAG 
Met Glu Phe Glu 
155 

ATG ATC TCA CGT 
Het He Ser Arg 



GGC GCA 
Gly Ala 

GAT TTC 
Asp Phe 

TTA GGT 
Leu Gly 
220 
GAT ATT 
Asp He 
235 

CTT TCT 
Leu Cys 



CTT ATT 
Leu He 
190 
GGT ACG 
Gly Thr 
205 

CTA ACA 
Leu Thr 

AGC GAG 
Arg Glu 

AAA GAG 
Lys Glu 



AAG TCA GCC TCA 
Lys Ser Ala Ser 
270 

AAA TaC TCT TTA 
Lys Tyr Ser Leu 
28S 

AAT GCT ATA GAC 
Asn Ala He Asp 
300 

GGA AAG GCT TTA 
GLy Lys Ala Leu 
315 



TCA ATA 
Ser lie 

GAC AGA 
Asp Arg 
160 
AAG ACA 
Lys Thr 
175 

GCT CGT 
Ala Gly 

AAT CTA 
Asn Leu 

GCA GAC 
Ala Asp 

GGT AAA 
Gly Lys 
240 
GAC GAG 
Asp Glu 
255 

AAA CAA 
Lys Glu 

GAT TAT 
Asp Tyr 

TCT TTA 
Ser Leu 

AAA TAT 
Lys Tyr 
320 
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720 



768 
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CTA GCT GAA TTT ACG ATA AGA AGG AGA AAA TAA 993 
Leu Ala Glu Phe Thr He Arg Arg Arg Lys TEK 

325 330 

SEQ ID NO: 7 

Sequence Length: 26 

Sequence Type: Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic DNA 

4 

Sequence 

AAGAGAAGCT TATGAGTTAC TTTGAC 26 

SEQ ID NO: 8 

Sequence Length: 21 

Sequence Type: Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic DNA 

Sequence 

GATACAAGCT TTATTTTCTC C 21 

SEQ ID NO: 9 

Sequence Length: 28 

Sequence Type: Nucleic acid 

Strandness: Single strand 

Topology: Linear 

Molecular Type: Synthetic DNA 

Sequence 

CCCCCCTCGA GGTCGACGGT ATCGATAA 28 



The present invention discloses a mutated enzyme comprising a geranylgeranil diphosphate synthase having its 
origin in wild type Sulfolobus acidocaldarius wherein, one of at least phenylalanine at position 77, methionine at position 
85, valine at position 99, tyrosine at position 101, phenylalanine at position 118, arginine at position 199 and aspartic 
acid at position 31 2 is substituted with another amino acid. 

Claims 

1 . A mutated enzyme wherein at least one of phenylalanine at position 77, methionine at position 85, valine at position 
99, tyrosine at position 101, phenylalanine at position 1 1 8, arginine at position 1 99 and aspartic acid at position 31 2 
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in a geranylgeranil diphosphate synthase of Sulfolobus acidocaldarius origin, is replaced with another amino acid, 
which enzyme is able to form prenyl diphosphate having at least 25 carbon atoms, or a modified mutant enzyme 
that is modified by replacing, deleting and/or adding one to several amino acids, which enzyme maintains the activ- 
ity of the above-mentioned enzyme. 

2. An enzyme as set forth in claim 1 wherein at least phenylalanine at position 77 is substituted with another amino 
acid, 

3. An enzyme as set forth in claim 2 wherein said amino acid is a non-aromatic amino acid. 

4. An enzyme as set forth in claim 2 wherein phenylalanine at position 77 is substituted with a non-aromatic amino 
acid. 

5. An enzyme as set forth in either claim 2 or claim 3 wherein valine at position 99 is further substituted by another 
amino acid. 

6. An enzyme as set forth in either claim 2 or claim 3 wherein tyrosine at position 1 01 is further substituted by another 
amino acid. 

7. An enzyme as set forth in claim 1 wherein at least methionine at position 85. arginine at position 1 99. and aspartic 
acid at position 312 are substituted with other amino acids. 

8. An enzyme as set forth in claim 1 wherein at least phenylalanine at position 1 1 8 is substituted with another amino 
acid. 

9. A gene that codes for an enzyme as set forth in any of claims 1 through 8. 

10. An expression vector that contains a gene as set forth in claim 9. 

11 . A host transfected by an expression vector as set forth in claim 1 0. 

1 2. A process for production of an enzyme according to claim 1 1 in a process for producing an enzyme as set forth in 
any of claims 1 through 8. Claim 1. comprising the steps of 

culturing host cells transformed with an expression vector comprising a gene coding for the enzyme of claim 1 , 
and ... , 

recovering the enzyme. 

1 3. A process for production of a mutated prenyl diphosphate synthase comprising the step of: 

culturing host cells transformed with a gene mutated by substitution of a codon for the amino acid residue at 
fine upstream to the amino terminal of the aspartic acid-rich domain 1 with a codon for a non-aromatic amino 
acid residue so as to express the mutated prenyl diphosphate synthase which can produce longer chain of pre- 
nyl diphosphate than those produced by the original wild-type prenyl diphosphate synthase. 

1 4. A process for production of a prenyl diphosphate equal to or larger than those having 25 carbon atoms, comprising 
reacting an enzyme according to any one of claims 1 to 8 or an enzyme produced by a process according to claim 
12 or 13 with a substrate selected from isopentenyl diphosphate, dimethylallyl diphosphate, geranyl diphosphate, 
farnesil diphosphate and geranylgeranyl diphosphate. 
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(54) Long-chain prenyl diphosphate synthase 

(57) The present invention discloses a mutated 
enzyme comprising a geranylgeranil diphosphate syn- 
thase having its origin in wild type Sulfolobus acido- 
caldarius wherein, one of at least phenylalanine at 
position 77, methionine at position 85, valine at position 
99, tyrosine at position 101, phenylalanine at position 
118, arginine at position 199 and aspartic acid at posi- 
tion 312 is substituted with another amino acid. 
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